Learning Spark by Jules S. Damji
Author:Jules S. Damji
Language: eng
Format: epub
Publisher: O'Reilly Media
Published: 2020-06-28T00:00:00+00:00
The deterministic nature of the tasks ensures that the output data is the same no matter how many times the task is reexecuted. This crucial characteristic enables Spark Streaming to provide end-to-end exactly-once processing guarantees, that is, the generated output results will be such that every input record was processed exactly once.
This efficient fault tolerance does come at the cost of latency—the micro-batch model cannot achieve millisecond-level latencies; it usually achieves latencies of a few seconds (as low as half a second in some cases). However, we have observed that for an overwhelming majority of stream processing use cases, the benefits of micro-batch processing outweigh the drawback of second-scale latencies. This is because most streaming pipelines have at least one of the following characteristics:
The pipeline does not need latencies lower than a few seconds. For example, when the streaming output is only going to be read by hourly jobs, it is not useful to generate output with subsecond latencies.
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Modelling of Convective Heat and Mass Transfer in Rotating Flows by Igor V. Shevchuk(6412)
Weapons of Math Destruction by Cathy O'Neil(6234)
Factfulness: Ten Reasons We're Wrong About the World – and Why Things Are Better Than You Think by Hans Rosling(4719)
A Mind For Numbers: How to Excel at Math and Science (Even If You Flunked Algebra) by Barbara Oakley(3281)
Descartes' Error by Antonio Damasio(3256)
Factfulness_Ten Reasons We're Wrong About the World_and Why Things Are Better Than You Think by Hans Rosling(3219)
TCP IP by Todd Lammle(3164)
Fooled by Randomness: The Hidden Role of Chance in Life and in the Markets by Nassim Nicholas Taleb(3085)
Applied Predictive Modeling by Max Kuhn & Kjell Johnson(3047)
The Tyranny of Metrics by Jerry Z. Muller(3038)
The Book of Numbers by Peter Bentley(2946)
The Great Unknown by Marcus du Sautoy(2671)
Once Upon an Algorithm by Martin Erwig(2631)
Easy Algebra Step-by-Step by Sandra Luna McCune(2609)
Lady Luck by Kristen Ashley(2563)
Police Exams Prep 2018-2019 by Kaplan Test Prep(2522)
Practical Guide To Principal Component Methods in R (Multivariate Analysis Book 2) by Alboukadel Kassambara(2520)
All Things Reconsidered by Bill Thompson III(2375)
Linear Time-Invariant Systems, Behaviors and Modules by Ulrich Oberst & Martin Scheicher & Ingrid Scheicher(2350)